Prosodic Cues for Automatic Word Boundary Detection in ASR

نویسندگان

  • Klara VICSI
  • György SZASZÁK
  • Philippe Langlais
چکیده

This article presents a cross-lingual study for agglutinative, fixed stressed languages, like Hungarian and Finnish, about the segmentation of continuous speech on word level by examination of supra-segmental parameters. We have developed different algorithms based either on a rule based or a datadriven approach. The best results were obtained by data-driven algorithms (HMMbased methods) using the time series of fundamental frequency and energy together. This HMM based method will be described in this article. Word boundaries were marked with acceptable accuracy, even if we were unable to find all of them. On the base of this study a word level segmentationer has been developed which can indicate the word boundaries with acceptable precision for both languages. The evaluated method is easily adaptable to other fixed-stress languages.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prosodic Trees for Boundary Detection in ASR in French

Prosodic trees as a hierarchical representation of prosodic organization in French proved to be efficient for automatic processing of continuous speech. We applied this technique to the prosodic boundary detection on the output of a speech recognition application in order to test whether prosodic boundaries of different levels in tree confirm or not recognition hypotheses. Two types of tree con...

متن کامل

Word segmentation in Persian continuous speech using F0 contour

Word segmentation in continuous speech is a complex cognitive process. Previous research on spoken word segmentation has revealed that in fixed-stress languages, listeners use acoustic cues to stress to de-segment speech into words. It has been further assumed that stress in non-final or non-initial position hinders the demarcative function of this prosodic factor. In Persian, stress is retract...

متن کامل

Automatic Detection of Emphasized Words for Performance Enhancement of a Czech ASR System

This paper deals with a problem of prosodically emphasized word detection in Czech speech. The main goal is to propose an automatic emphasized word detection system that would be component of an Automatic speech recognition system (ASR) and would enrich its text output with highlighting emphasized words. The detection method is based on Czech prosodic rules and uses speech signal intensity, pit...

متن کامل

Using Prosody for Automatic Sentence Segmentation of Multi-party Meetings

We explore the use of prosodic features beyond pauses, including duration, pitch, and energy features, for automatic sentence segmentation of ICSI meeting data. We examine two different approaches to boundary classification: score-level combination of independent language and prosodic models using HMMs, and feature-level combination of models using a boosting-based method (BoosTexter). We repor...

متن کامل

How far can prosodic cues help in word segmentation?

Prosodic cues are of great importance in parsing speech signal into prosodic and lexical units. Listeners detect the changes of the prosodic parameters and interpret them to detect sentence modalities or the mood of the speaker. Some automatic speech recognition systems try to use prosodic parameters to detect boundaries of prosodic units and help thus the acoustic decoding process. Although th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006